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Abstract: 

The distortion of images of faint, high-redshift galaxies by light deflection at foreground 
clusters of galaxies can be used to determine the (projected) mass distribution of the 
clusters. In the case of strong distortions, which lead to arcs in clusters, the position of 
the arc and/or its radius of curvature yields an estimate for the total mass inside a circle 
traced out by the arc. Weak distortions, which can be observed to much larger angular 
separations, can be used to determine the mass profile. In the case of weak distortions, 
an approximation which identifies the observed distortion with the shear produced by 
Q\ the lens can be made; this (linear) approximation breaks down, however, if one wants 

^^ to probe the center of the cluster, i.e., approach the region in which giant arcs can be 

^ formed. The methods devolped hitherto, the most advanced of which is that by Kaiser & 

*~^ Squires, rely on the linear approximation and can thus yield no reliable results for regions 

^ close to the center of the cluster. The purpose of this paper is to provide a theoretical 

basis to generalize the Kaiser & Squires method to the nonlinear regime, thereby making 
pq it more powerful, since the nonlinear distortions provide strong constraints on the mass 

CO profile, which in combination with the weak distortion should yield more reliable cluster 

r^ inversions. 

O We discuss the statistical properties of the observable image ellipticities and provide 

several methods to determine the local distortion of the lens from observed galaxy images, 

some of which rely only on the orientation of the images, not on their ellipticity. Ana- 

P_i lytic approximations for the error in the locally determined distortion are provided. An 

is invariance transformation of the density profile of the cluster is derived which leaves the 

^ observable distortion invariant; this transformation differs from that valid in the linear 

^ approximation where it is simply the addition of a homogeneous matter sheet. We then 

investigate the inversion of spherical clusters; in particular, we show the precision with 

which the center of a spherical cluster can be determined, both with and without usage of 

the absolute value of the image ellipticities. We show that inclusion of the absolute value 

of the ellipticities increases the accuracy of the center position considerably. Finally, we 

study the fully nonlinear inversion of spherical clusters in some detail, to illustrate the 

difficulties one has to anticipate in nonlinear reconstructions of realistic two-dimensional 

matter distributions. We compare the direct nonlinear inversion with the method of 

fitting parametrized mass profiles to the distortion data, and point out relative strengths 

and weaknesses. In particular, we emphasise the infiuence of the above-mentioned global 

invariance transformation and an additional local degeneracy. 



1 Introduction 

Since the first discovery of lensing in clusters (Lynds & Petrosian 1986, Soucail et al. 
1987) and the interpretation of giant luminous arcs as gravitationally distorted images of 
background galaxies (Paczynski 1987), which was later verified by the redshift measure- 
ments of arcs (e.g., Soucail et al. 1988), it was realized that these lensing events provide a 
unique means to study the mass distribution in clusters. Although a single luminous arc 
can only determine the mass enclosed in a circle traced out by the arc, the observation of 
several arclets (Fort et al. 1988) in the cluster A370 can be used to set more constraints 
on the mass distribution of this cluster (Grossman & Narayan 1989, Kochanek 1990). 
The pioneering observations by Tyson, Valdes & Wenk (1990) demonstrated that it is 
feasible to measure even weaker distortions of distant galaxies caused by a foreground 
cluster, and that these weak distortions can be used to trace the surface mass distribution 
of the cluster. A detailed study of these possibilities has been performed by Kochanek 
(1990) and Miralda-Escude (1991, ME91 hereafter), where the main emphasis was put on 
determining the profile of spherical clusters. Kneib et al. (1994) have tried to determine 
the redshift of several arclets in the cluster A370, for which a fairly detailed lens models 
has been constructed by Kneib et al. (1993), by calculating that redshift of the source 
which minimizes the intrinsic ellipticity of the source, given its observed image ellipticity 
and location in the cluster. For a recent review on cluster lenses, see Fort & Mellier 
(1994). 

Major progress then came through the work of Kaiser & Squires (1993, hereafter 
KS), in which an inversion formula for the mass profile of a cluster was derived and 
tested on numerical data and the observations of Tyson et al. (1990). Weak distortions 
have been verified around the double QSO 2345+007 (Bonnet et al. 1993), increasing 
the evidence for this system to be a gravitational lens system. The application of the 
Kaiser & Squires method to several clusters (Fahlman et al. 1994, Small et al. 1994a, b) 
have shown the principle potential of this method, but has also disclosed some of its 
weaknesses. For example, the direct reconstruction method as proposed in KS yields 
fairly noisy matter distributions. 

This is the first paper of a series in which we intent to study the problem of cluster 
inversion in more detail. In particular, the method of Kaiser & Squires applies specif- 
ically to the case of weak lensing, and thus the constraints from stronger distortions 
(i.e., arcs and arclets) cannot be easily merged with their inversion technique. More 
specifically, the method of KS is based on the fact that the shear distribution caused 
by a defiector depends linearly on the surface mass density; this linear relation can be 
inverted. However, the local shear cannot be obtained from image distortions; only in 
the case of weak lensing can the shear be identified with the local distortion. In the more 
general case which includes strong distortions, the relation between shear and distortion 
is nonlinear. One of the goals of this work is to find an inversion method which allows 
the inclusion of nonlinear effects. Furthermore, whereas the KS inversion equation is 
exact in the linear regime, its application to an observed sample of distorted images is 
burdened by several problems: observation of a finite number of galaxies introduces a 
shot noise to the inverted density profile. The prize for dealing with that noise is the 
finite resolution produced by a smoothing process. As we shall demonstrate in our next 
paper (Seitz & Schneider, in preparation), the smoothing procedure proposed by KS can 
be significantly improved. Second, the inversion formula requires data over the whole 



lens plane, whereas real observations are limited by the size of the CCD. In applying the 
inversion formula, an extrapolation of the distortion field into the unobserved part of the 
sky must be performed; equivalently, the surface mass density at the edge of the observed 
field must be specified. In either of these precedures, edge effects occur, which are most 
clearly visible in Fig. 7 of Small et al. (1994b). These edge effects can be reduced by 
choosing a 'reasonable' extrapolation of the distortion field. 

This first paper is confined to some basic considerations of the distortion of images 
by a foreground gravitational lens. In order to concentrate on the essentials, we will 
make the approximation that all sources (galaxies) have the same redshift, or that the 
distance ratio D^s/Dg (see Sect. 2) is the same for all sources. This assumption is not 
valid in reality, though it may be a fair approximation for lensing clusters at relatively 
low redshift. In Sect. 2 we provide the basic equations for the transformation of source 
ellipticities and image ellipticities and introduce our notation. Section 3 deals with the 
local observables, which mainly is the distortion, but not the shear. In fact, we shall see 
that even if the local surface mass density were known, the shear cannot be uniquely 
determined locally from the local distortion, but a degeneracy occurs. This degeneracy 
provides one of the main obstacles for the nonlinear cluster inversion. The local distortion 
can be determined from a sample of images; we present a method how this can be done in 
principle, and derive a relation between the local distortion and the mean image ellipticity, 
as well as its dispersion. Fit formulae for both quantities are presented which simplify 
their application. We also consider the statistical properties of the image alignments, 
and show that one can determine the local distortion also from the alignments only, 
without referring to the absolute values of the image ellipticities. This point deserves 
particular attention, since the observational determination of the image alignments is 
much less affected by observing conditions (most notably seeing) than the absolute value 
of the ellipticity. We then derive an invariance transformation of the surface mass density 
which leaves the distribution of the distortions invariant; this invariance transformation 
generalizes that obtained by Kaiser & Squires, valid in the linear regime, which is simply 
the addition of a homogeneous sheet of matter. In fact, the invariance transformation 
derived here corresponds to the 'magnification transformation' of Gorenstein, Falco &; 
Shapiro (1988). 

The next two sections deal with an application to spherical clusters. In Sect. 4 
we consider several methods for determining the center of a spherical cluster from image 
distortions and apply them to synthetic data. It is demonstrated that for critical clusters, 
the center can be determined much more accurately than for noncritical ones, if one 
makes use also of more strongly distorted images. In Sect. 5 we consider three fairly 
different methods for obtaining the density profile of a spherical cluster. We compare 
these methods, one of which is similar to the method of Kaiser & Squires specialized to a 
spherical cluster, whereas the other two are based on parametrized models, and discuss 
their relative advantages and potential drawbacks. Our results are briefly summarized 
in Sect. 6. 



2 Imaging of extended sources 

We can describe an extended source by its matrix of second moments: let Is{d) be the 
surface brightness distribution of the source, and let q{I) be a function of the surface 



brightness, we define the center of the source by (see Blandford et al. 1991) 

and quadrupole matrix of the source, 

^(s) _ /d^/3g(J.(/3))(A-A)(/3j-^j) .„ „. 

The function q{I) can be chosen appropriately; for example, it can be a Heaviside step 
function, in which case the shape of the source within a limiting isophote contour would 
be obtained; alternatively, q{I) can be chosen to give more weight to the center of the 
source which, however, can be strongly affected by seeing effects, or to the outer regions 
of a source which is probably less sensitive to seeing effects, but likely to be more noisy. 
The choice of q has to be tested on real data. As an example, we consider a source with 
elliptical isophote contours, centered on /3 = 0, 

/s(/3) = jff + f 



^a2 62 
with b/a < 1 being the axis ratio, j3 = TZ{'d)f3' , 

7e(^)=f^°^t """fl (2.3) 

y sm V cos V J 

describing the orientation of the image, such that the major axis encloses an angle -d 
with the positive /5i-axis, and J being an arbitrary function describing the profile of the 
source. Then, 

gj'^' = Qo (a^ cos^ 'd + b'^ sin^ 'd) , 

gg = <5S = <5o (a^ -fe^) COST? sin ^ , 

gg = Qq (a2 sin^ i9 + b'^ cos^ i}) , 

and 

_ JdttU{j{t-)) 

^° 2jdttq{J{t^)) 

describes the 'size' of the source. 

To describe the shape (including orientation) of a source, we can define a complex 
number x^^\ given by 

(gS-gg)+2igg 

The ellipticity x^^^ is given in terms of the eigenvalues Ai > A2 > of g*^^^ as 

where -dg is the position angle of the eigenvector belonging to the larger eigenvalue Ai. 
For a source with elliptical isophotes. 



(,) ^ 1 - {h/a? 2i^. 
^ 1 + {h/aY 

If a sufficiently small source is imaged by a gravitational lens, it can be described 
by the linearized lens mapping 

d/3 = A{e) de , (2.5) 

where 

A{e)=('-^-^^ 1 -^l ] (2.6) 

V ~'T2 1 - K + 7i y 

is the Jacobian matrix of the lens equation 

f3 = e-cx{e) . (2.7) 

Here, ct{0) is the (scaled) deflection angle of a light ray at position 0, given in terms of 
the surface mass density E{DaO) of the lens as 

cc{e) = 1 / dH' n{e') i^^ , (2.8) 

with the dimensionless surface mass density 

k{0) = 4^ (2.9a) 

and the critical surface mass density 

where D^g, Dg and D^ are the distances from the lens and the observer to the source, 
and from the observer to the lens, respectively. In terms of the deflection potential 

ij){e) = - f d^d' K{9')\n\e-e'\ , (2.10) 

we have 

«(0) = V^(0) , 

K{e) = \w^{e) , 

1 (2.11) 

ii{G) = -{i>,ii-i>,22) , 

72(0) = V,12 > 

where indices separated by a comma denote partial derivatives with respect to 9i. For 
details concerning these lensing relations, cf. Schneider, Ehlers & Falco (1992, henceforth 
SEE). 

Eor the image of a source, we can deflne in analogy to (2.2) the matrix of second 
moments Qij] due to the invariance of the surface brightness in gravitational light de- 
flection, it can be easily shown that the transformation between the quadrupole matrix 
of source and image is given as 

Q^'^ = AQA , (2.12) 



where A is the Jacobi matrix of the lens equation, as defined in (2.5). Defining in analogy 
to (2.4a) the ellipticity % of the image, the transformation between x'^' and % is obtained 
from (2.12) as 

(s) ^ 2(1-k)7+(1-k)^x + 7V .2 13) 

(l-K)2 + |7|2 + 2(l-K)7^e(7x*) ' 

where 7 = 7i + i72 is the complex shear, and an asterisk denotes complex conjugation. 
It can be seen immediately that the transformation between x'^' and x does not depend 
on K and 7 separately, but only on the combination 

^ (2.14) 



^-(1-.) ' 
with which (2.13) becomes 



X^-^= ^ .''^^!/^\... • (2.15a) 



l + \g\' + 2'Re{gx 
The inverse transformation is 

l + \g\ - 27^e (^fx^ •' ) 

We note that the dimensionless surface mass density k, and thus the lens parameter g, 
depends on the redshift of the lens and the source. 



3 Local observables 



3.1 Basic considerations 

All cluster inversion methods are based on the assumption that the sources are oriented 
randomly on the sky, so that any net orientation of images is due to the tidal efi'ects of 
the defiecting mass distribution. Given the validity of this assumption, the determining 
equation simply reads 

>')=0 , (3.1) 



where the average is taken over an ensemble of sources. Throughout this paper we 
assume that all sources are at the same redshift, so that the lens parameter g is the 
same for all sources; this assumption is not a very realistic one in general, but may apply 
approximately for clusters with low redshift. Then, if we assume that g varies little over 
a region in the lens plane in which TV images are seen, we obtain an estimate for g by 
solving the equation 

[x^^[9])^j^Y.^^\9] = , (3.2) 



where we have explicitly written the dependence of the function x from (2.15a) on g. 
However, (3.2) does not have a unique solution g, as can be seen from the relation 



x<^'[^] = 4x*^'*[V^1 ; (3.3) 

hence, if 5^ is a solution of (3.2), so is \l g* . One therefore defines the (complex) distortion 

2g 2(1 - k)j 



l + \g\^ (1-k)2 + |7|2 ' 



(3.4) 



which is unvariant under the transformation g -^ l/^f*, and tries to find an equation for 
determining 5. Such is obtained by considering 

X+ ■ X [91 + X 1^/9 J i + ne{Sx*) 

W (s)r 1 (s)ri / *i (^i-\9f\ X-X*_S/S* 

X'J :=%' \9]-x' \l/9 ] = \j^') l + ne{Sr) ' 

hence, the combination 

depends solely on 5, and since (x'^'f^f]) and (x'^'fl/if*]) both have to vanish, the desired 
relation for determining 5 becomes 

iv(c) = y — ^i , = . (3.6) 

Instead of searching the root of equation (3.6), one can solve for 5 by the iteration 
equation 

and start the iteration with the linear approximation 

5o = - (x) . (3.8) 

In fact, the iteration quickly converges to a unique solution. Note that \5\ < 1. In terms 
of 5, the complex shear becomes 

.= (i^fl±Ar^l . (3.9) 



where the '+'-sign (' — '-sign) applies for (1 — k)^ < I7I ((1 — k)"^ > \j\ ), or det^ < 
(det A > 0), respectively. 

It is instructive to investigate the behaviour of x under the transformation g -^ 
1/g*. The components of x'^' parallel and perpendicular to 5 are 

(,) ^ -Re (x(-)^*) _ 1 (,,, ^ 7^e (Sx*) 

Xll — is-i — -, , -n_ /r. \ rl + 



l + ne{dx) V 1^1 

l + l^l 



(,)_Xm(x(-)^*) Xm(^*x) 1 f 1 - \9\'^ ^^'^^^ 



^^ \5\ l + neiSx*) \S^ * - ' '-'2 



Prom these relations, we see that the component parallel to 5 remains unchanged, and the 
component perpendicular to 5 changes sign, under the transformation g -^ 1/fi'*- In fact, 
as long as we consider all sources to have the same redshift, the degeneracy between g and 
1/g* cannot be broken by any local measurement. To see this, we define the probability 
distribution Ps (x'^') such that Ps (x'^-*) d^x*'^' is the probability that the source has an 
ellipticity within d^x'^' of x'^'- Due to the assumed isotropy of the source orientation, 
this probability distribution can depend only on x i furthermore, Ps is normalized. 
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(3.11) 



The probability distribution p(x) of the images is then 



P(x) =Ps ( X*"' {X,9 



d^X 



from (3.10) one finds that 



X 
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{i-\sf)(i-\x\') 



and the Jacobian of the transformation is readily evaluated; hence. 



(3.12) 



P{x) = Ps 



1 


, (-^^)(-x^)] 


[l + 7^e(5x*)]' 
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,2n2/2 
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[l + 7^e(5x*)]' 



(3.13) 



and we see that this probability distribution depends solely on 5, and is thus invariant 
under the transformation g -^ '^jg* ■ Thus, the best one can hope to measure from the 
local distribution of galaxy images is the value of 5. 

3.2 Statistics of single images 

From a single galaxy image, or its ellipticity Xi can one obtain some estimate (however 
uncertain) of 5, or can, for an assumed value of 5, a measure be given for the deviation of 
X from its expectation value? Unfortunately, the answer to the first question is negative: 
the way one would estimate the expectation value of 5 is to invert (2.15) and write ?> as 
a function of x and x'^'; then, an expectation value for 5 could be obtained from the 
probability distribution p^. However, for given x and x'^', there are in general two values 
of 5 which satisfy the transformation equation (2.15). Thus we are left with the second 
of the above mentioned possibilities, to determine the statistical properties of x for a 
given value of 5. 

The reason why one is interested in these statistical quantities is that of fitting a 
mass model to observational data. Any mass model predicts 5 as a function of position 
in the lens plane; this means that for any given galaxy image, one can compare the 
observed value of x with the local expectation value (x) and define a local error measure 
Ix — (x)l /<^^(x)) the parameter of the lens model can then be chosen as to minimize the 
sum of the local errors. Here, cr(x) is the dispersion of x, which is a function of |5|. The 



functions (%) and cr(x) depend, besides on 5, also on the intrinsic distribution of source 
ellipticities. Hence, the usefulness of these measures depends on the degree to which this 
distribution function can be obtained from observations. 

The expectation value (%) of % must have the same phase (or 'direction') as 5; we 
can therefore consider the special case of 82 = Xm(5) = 0, which implies, from (3.4), that 
g2 =Im{g) = 0. The transformation (2.15) then becomes 



X2 = iMx) 



X2 



1-9' 



(3.14) 



1-5x^1+^^ 



with 5 = 5i, g = Qi. The expectation values can now be calculated from the probability 
distribution Ps of the source ellipticities; after performing the angular integration, we 
find 



{x.) = 2.j^AyyMy-^j=^^-\ 



f 

Jo 

(X2)=0 , 

(Xi> = 27r / dy yps{y) 
Jo 

(X1X2) = , 



1 - 52 /■! 

(xi) = 27r ^^ / dy yp^iy) 



1 - S^yY - \/l - 52y2 (1 - 54 - 252^2 + 25^2/2) 



52(1-52^2)^ 
:i-^V)^-v/r^^7V (1-2^2^2) 

(l-52y2)2 



For an arbitrary phase of 5, we define the function C (1^1) by 

(X) = -C(|5|)5 , 
which in our case (^2 = 0) becomes ({S) = — (xi) /S. The dispersion of %, 



(3.15) 

(3.16) 



a. 



Ixl 



\{x)\' = ^J{xi) + {xi)-{xiy 



(3.17) 



where the final expression applies to our special choice 82 = 0, is invariant against 
rotations of 5. For any given choice of the distribution function Ps, one can now calculate 
C(|5|) and the corresponding dispersion. 

Expanding the integrands in (3.15) in powers of y, we can obtain an expansion of 
the mean values in moments of the ellipticity distribution function, defined as 



Mn = 27T f d2/y("+i)p,(y) ; 
Jo 



(3.18) 



we then obtain: 



as) = i-{i-5') 

(Xl) = (1 - 5') 
so that 
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Fig. 1. The function C(^)i defined in (3.16), for an ellipticity distribution of the form (3.21), for several 
values of the width R. The solid curves show the exact value of C(^)i whereas the dotted curves show 
the approximation C{^): see (3.22). It can be seen that ^ provides a good approximation for all values 
of 5 and R. The curves are labelled by the value of R 



To illustrate the preceding calculation, we consider a specific distribution of source 
elhpticities, of the Gaussian form 



Ps(|x<^'|) 



c^'^'/R' 



TTJ 



so that R describes the width of the distribution. Its first few moments are 



(3.21) 
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Fig. 2. The dispersion a^ as a function of 5, for an ellipticity distribution of the sources given by 
(3.21), for several values of R. The solid curves show the exact values of u^, the dotted curves are the 
approximation a-^^, given by Eq.(3.23). The curves are labelled by the value of R 

In Fig. 1 we have plotted the function <^(5) (solid curves) for several values of the 
width R. For 5 ^ 1, C ^ 1, as can also be seen directly from the the definition (3.16) of 
C, and the expression (3.15) for (%). For smaller values oi 5, C, < 1, which implies that the 
linear relation 5 = — (%) - see (3.8) - underestimates the true value of 5. The deviation 
of C from unity, and thus the error made by using the linear relationship, depends on 
the width of the ellipticity distribution. Fig. 2 displays the dispersion a^ as a function 
of 5, again for several values of the width R of the distribution (3.21). For 5 ^ 1, a^ 
vanishes - all sources (formally) attain axis ratio zero if 5 -^ 1, independent of their 
intrinsic ellipticity. As expected, the dispersion depends sensitively on the width of the 
distribution p^. Analogous calculations can be found in ME91. 

Since we shall use the functions C and a^ extensively below, it will be useful to find 
an approximation for these function. It turns out that a useful approximation is 
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where the parameters have been fixed by requiring that C = C fo^" S = and 1, and that 
the second derivative of both functions at 5 = agree. Correspondingly, 
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a^ = A{l-5') 
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and the parameters have been fixed by the same requirements as those for C,. In Figs. 1,2, 
the functions C and a^ have been plotted as dotted curves; it is seen that they agree 
well with C and a^, respectively, except if one comes close to 5 ~ 1. The special form 
of the approximations (3.22 & 23) may be due to the specific choice of the intrinsic 
ellipticity distribution (3.21), but similar functional relationships are expected also for 
other distributions pg. 

The measurement of the absolute value of % is affected by seeing, noise, tracking 
problems and other effects. As argued by Kochanek (1990), one should try to use only 
the orientation of % to determine the cluster mass profile, since it is less affected by 
observational problems that % itself. We therefore consider next the statistical properties 
of the orientation of %. 
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Fig. 3. The probability distribution for the angle 'ip between the direction of the minor axis of the equiva- 
lent image ellipse and the local direction of the shear. The intrinsic ellipticity distribution (3.21) has been 
assumed, the solid curves are for R = 0.3, the dotted curves for R = 0.5. The distributions are displayed 
for \5\ = 0.05,0.1,0.15,0.2,0.25, with the smaller values corresponding to the flatter distributions 



For this, we define -0 to be the angle between the minor axis of the equivalent 
image ellipse, and the local direction of 5: if we write % = (%) e^'^ and 5 = |5| e^"^, then 
ip = 9 — (f) — n/2. In particular, for an intrinsically round source, '0 = 0. The probability 
distribution for if) is obtained from (3.13) to be 
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(3.24) 
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which is normalized on ip E [—n/2,n/2]. In Fig. 3, we have displayed the function 
Pa.{ip), assuming the distribution (3.21) for the source ellipticities, for two values of R 
and several values of 5. Since Pa{ip) is an even function of -0, we have plotted only the 
interval ip G [0, n/2]. As expected, the larger the value of i?, and the smaller S, the flatter 
is the angular distribution function. 

The mean value {ip) = 0; this is equivalent to saying that the mean orientation of 
the images immediately yields the local orientation of the distortion, which in turn agrees 
with the local orientation of the shear. Information about the strength of the distortion 
is contained in moments of the angular probability distribution. We define 



(cos 2-0) = / dip pg,{tp) cos2'0 

J—7r/2 

and the corresponding dispersion 



(3.25a) 



o"(cos2'0) = v/(cos2 2'i/') — (cos2'i/')" 



(3.256) 



and display both quantities in Fig. 4. The average (cos 2ip) increases linearly for small 
5, with a slope depending on the intrinsic ellipticity distribution, and approaches unity 
for 5 ^ 1, as expected. The corresponding dispersion is l/-\/2 at 5 = and vanishes as 




Fig. 4. The mean value (cos 2^/;) (solid curves) and the corresponding dispersion a{cos 2xjj) (dotted curves) 
as a function of 5, for the choice (3.21) of the intrinsic ellipticity distribution. The curves are labelled 
by the value of R 

Unfortunately, there is no simple expression for (cos 2^) or a in terms of moments 
of the intrinsic ellipticity distribution. For later reference, we list here the lowest order 
term for the intrinsic distribution (3.21), 
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(co.2^) = ^lf_f%f^^ S + 0(S-) , (3.26) 

where erf (a;) is the error function. Note that (cos 2^) is very insensitive to 5 once this 
quantity approaches unity; this property will be of some relevance in the next subsection. 

3.3 Local determination of the distortion 

In the previous subsection we have considered the statistical properties of the image 
ellipticity for a given distortion; here we consider the problem to determine a value of 
the distortion from a given set of image ellipticities. The situation we have in mind 
is to have a set of N galaxy images with measured ellipticity Xi i^ ^^ area which is 
small enough to assume that 5 is appoximately the same for all images. How can 5 be 
determined? 

We consider basically two methods to estimate S. The first method (hereafter 
Method la) has already been indicated in Sect. 3.1, namely to solve (3.6), the condi- 
tion which expresses the intrinsic isotropy of the source distribution. The equation is 
efficiently solved by the iteration (3.7), which after three to five iterations yields the cor- 
rect solution of (3.6) with an accuracy of better than 10~^ (i.e., much better than one can 
ever hope to need). A slight variant of this method is provided by applying the relation 
(3.16), i.e., to determine the mean ellipticity (%) from the images and to solve (3.16) for 
5, either using the exact form of C, or the approximation ^, as given in (3.22) (hereafter 
Method lb). The advantage of Method la is that the intrinsic ellipticity distribution 
need not be known, whereas for Method lb, the function Cd^l) depends on the intrinsic 
ellipticity distribution. 

A different method is provided by the statistical properties of the image orientations 
only, i.e., without using the absolute values of the Xi- The phase (or direction) of the 
distribution is given by the relation (■0) = 0, or more practically, (sin2f/') = 0. This 
yields for the direction (f) of the distortion 

(sin2e) 

tan2(^=^^ -f . (3.27) 

^ (cos 2^) ^ ^ 

Then, the amplitude of the distortion is determined by calculating (cos 2^) and (numer- 
ically) inverting the function (cos 2ip) (5) plotted in Fig. 4. We henceforth refer to this 
method as Method 2. 

We have run simulations in which N sources were distributed, according to the 
intrinsic distribution (3.21), and mapped through a distortion 5. From the images, i.e., 
the Xi, we have determined the estimates 5'^', with x = la, lb, 2, with each of the above 
mentioned methods. Repeating the simulation, we have determined the rms error 



a^-) = J (\5i-) - 5\ ) (3.28) 



for each method, which depends on the distortion 5. These errors are displayed in Fig. 5. 
From the statistical results for single images, we can obtain a simple approximation 
of the rms error corresponding to Method 1. Assuming Poisson statistics, the expected 
error is a'-*^' = a^/y/N, where a^ is defined in (3.17) and plotted in Fig. 2. The dotted 
curves in Fig. 5 correspond to this Poisson error, with a^ replaced by a^, as given in 
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Fig. 5. The rms error of the determination of the distortion, as defined in (3.28). These curves were 
determined from 10 simulations for each value of 5, with N = 10, 20, and 40 sources in each case. The 
intrinsic ellipticity distribution (3.21) was assumed, with R = 0.3. The solid curves show the error a^ ' 
as obtained from Method la, short-dashed curves show u' ', and long-dashed curves show a^ ' . The 
dotted curves show the analytic estimate of the error for Method 1, assuming Poisson statistics (see text) 

(3.23). We see that this analytical estimate yields a very accurate description of the true 
error of Method 1. The small, but systematic deviations of the analytical estimate from 
the true error are due to the fact that a^ describes the dispersion of (%) for a given 5, 
which is not the same as the dispersion of 5 for a given (%) . 

We see from Fig. 5 that in the 'linear' regime, 5 <C 1, the determination of 5 from 
Method 2 is not very inferiour to that of Method 1, whereas the error is very much larger 
in the nonlinear regime (5 <^ 1). The origin of this large error in the nonlinear regime 
can be seen from Fig. 4. Since the amplitude of 5 is determined from inverting (cos 2i/)) 
as a function of 5, and since this function becomes very fiat in the nonlinear regime, the 
amplitude of 5 can only be determined crudely. This, however, does not necessarily imply 
that the determination of S from image orientation alone is precluded. It is likely that 
one can define statistical quantities from the angular probability distribution function 
(3.24) which are more sensitive functions of 5 in the nonlinear regime. For example, one 
could use the function ^1 — 4(ip — 7r/2)^/7r^) to determine 5 in the nonlinear regime, or 
make use of the full angular distribution Pa(V')j we hope to return to these issues in a 
later paper. 

From the numerical simulations, we can also obtain the distribution of estimated 
values of 5, p{5,5tiue), where Stme is the true value of 5 with which the galaxy images 
were imaged. In Fig. 6 we have plotted the contours of equal proability p(5, 5true)> for 
Method la and 2. This figure illustrates that the estimated values are isotropically 
distributed around the true value if 5true is in the linear regime, but anisotropics of 
the error occurs for larger distortions. For Method 1, these anisotropics are oriented 
such that the error in the 'radial' component of 5 is smaller than that in the tangential 
direction; this decrease of the error in the radial direction is directly related to the fact 
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Fig. 6. Contours of equal probability p{5, (5true) are plotted, for i5tiuc = 0.15(1 + i) (Panel a) and (5true = 
0.5(1 + i) (Panel b). Solid contours correspond to values of 5 determined by Method la, dashed contour 
to those obtained by Method 2. The levels show contours of 10%, 20%, . . . , 90% of the peak probability 

that the error of the absolute value of S decreases with increasing \5\, a fact which is 
illustrated by the decrease of the function a^ towards 5^1. In contrast, the error in 
the determination of the direction of 5 from Method 2 is about the same as that from 
Method 1, but the error of the absolute value of 5 is much larger than that for Method 
2, a direct consequence of the flatness of the function (cos2'0) for 5^1. 

3.4 An invariance transformation 

Can one reconstruct the surface mass distribution k{0) from the observable image dis- 
tortion 5{0)1 This question has been investigated by KS in the linear regime, i.e., for the 
case that one can set 5 ^ 2g ^ 27. They have shown that the finite region of the sky in 
which the distortion is measured leads to a free parameter in the reconstructed surface 
mass density, namely an overall additive constant, which is related to the surface mass 
density at the 'edge of the frame'. Here we show that a similar invariance transformation 
exists in the fully nonlinear case. 

Let S{0) be the observed distortion field, and suppose that the mass distribution 
no{0) is an inversion of the profile, i.e., no{6) satisfies the relation (3.4). Then, the 
one-parameter family of mass profiles 
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K{e;X) ■=!- X + Xko{0) (3.29) 

also yields an inversion of (3.4). This transformation corresponds to an additive constant 
of the surface mass density, plus a related scaling of kq, and is similar to the 'magnification 
transformation' pointed out by Gorenstein, Falco & Shapiro (1988; see also Sect. 13.1.1 
of SEF). The proof of the validity of this invariance property is simple: first, the trans- 
formation (3.29) yields (1 — k{0)) = A(l — ko{0)). Second, the constant term in (3.29) 
leads to a linear term in the deflection angle, so that a{0) = (1 — X)6 + Aq:o(^), where 
ao{0) is the deflection angle corresponding to kq. Calculating the Jacobi matrix (2.6) 
from this deflection law leads to the result that 7 = A70. Hence, both the numerator 
and denominator of (3.4) are multiplied by A^ under the transformation (3.29), so that 
S remains invariant. 

In the linear case, |A — 1| <C 1, and the transformation (3.29) indeed corresponds 
to adding a constant surface mass density, in agreement with KS. The exact nonlinear 
transformation is only slightly more complicated. It should be noted that the transfor- 
mation (3.29) leaves the critical curves unchanged; in addition, the curves of k{0) = 1 are 
unchanged. In general, the transformation of the magnification is ii{0, A) = X~^ iJ,o{0). 

In practice, one can obtain constraints on A; for example, since the surface mass 
density cannot be negative, A must be chosen such that k > everywhere. Other kinds 
of possible constraints are to require that k behaves like a power law of |0| for large values 
of \0\. For the case of a spherical cluster, we will discuss this invariance in somewhat 
more detail in Sect. 5 below. 



4 Spherical clusters 

As an illustrative application of our previous results, we want to determine in this section 
the special case of a cluster with axially-symmetric matter distribution, i.e., k{0) = 
ii{\0\)- In Sect. 4.1 we discuss the theoretical behaviour of the distortion in spherical 
clusters and choose a particular parametrized lens model. The determination of the 
cluster center is discussed in Sect. 4.2, where we also investigate the minimum mass a 
cluster must have to produce a signal in the image distortion. In the next section, we then 
discuss several methods to reconstruct the density profile from observed galaxy images. 

4.1 The distortion 5 

The direction of the shear in a spherical cluster is always radial: at a position = 
{9cos(p,9sm(p), the complex shear is 71 + 172 = -Te^"'', where r{9) is a real function, 
not necessarily positive, which depends only on the angular separation 6 of the point 
considered from the cluster center, chosen here to be at = 0. In the case of a spherical 
cluster, the defiection angle (2.8) reduces to (see SEF, Sect. 8.1) 

a{e) = ^=:lj\eeK{e) , (4.i) 

and the direction of a is colinear to that of 0; in the second step of (4.1), we have defined 
the dimensionless mass m{9) inside a circle of radius 9. To determine the function r{9), 
it is easiest to consider the Jacobian matrix A{0) on the ^i-axis: the matrix element An 
there becomes 
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so that 



da 
Aii = l-K + r = l- — 

ad 



r{e) = ^-K{0)=:R{e)-K{e) , (4.2) 



where in the second step we have defined R{9), the mean dimensionless surface mass 
density inside a circle of radius 9. For centrally-condensed mass distributions, k' = 
d/^/d^ < 0, K > K, and thus i^ > for all 9. The inversion of (4.2) can be easily obtained 
by multiplication with 9^ and differentiation: 

r°° c\f)' 

K{9) = -r{9)^2J^ —r{9') , (4.3) 

where the condition that k ^ and -T ^ as ^ ^ oo has been incorporated in the upper 
integration limit. Equation (4.3) can also be derived from the inversion formula of KS, 
if the latter is specialized to symmetric matter distributions. This relation provides the 
basis of cluster inversion methods: if the shear r{9) can be determined observationally, 
(4.3) can be used to reconstruct the density profile. There are two principal problems 
the method is faced with: observations will lead to a distribution of galaxy images in a 
limited solid angle around the cluster, whereas the integral in (4.3) extends to infinity; 
hence we expect that k{9) can be determined from observations of a limited field only up 
to some transformation (we shall show below how this transformation looks like in the 
spherical case). Second, as we have discussed in Sect. 3.1, the shear is not an observable; 
what can be observed is the distortion 5 - see (3.4) - which is related to the shear and 
the local surface mass density. Hence, whereas (4.3) is a linear relation between F and k, 
the relation between 5 and k is a nonlinear one. In Sect. 5 we shall discuss how a cluster 
inversion can be performed in the nonlinear case. 

We can now discuss the qualitative behaviour of the distortion 



(i-K)2 + r2 

as a function of 9, for 'typical' mass profiles. As such we consider profiles k{9) which 
are finite (k ^ for ^ ^ oo), centrally condensed {dK/d9 < 0), and which have at most 
two critical curves, i.e., circles at which the determinant detA of the Jacobi matrix (2.6) 
vanishes (note that, in contrast to common belief, even a centrally condensed symmetric 
lens can have more than two critical curves, but in order for this to occur, the mass 
profile must be fairly special). Prom the discussion in SEF, Sect. 8.1.1, we see that 
critical curves occur if, and only if, the central surface mass density k(0) > 1. The radius 
of the tangential (radial) critical curve will be denoted by 9t (9^), with 9^ < 9t, and is 
given by 

R{9t) = 1 ; 1 + R{9r) - 2k{9^) = . (4.5) 

Alternatively, at the critical curves, 1 — k ± -T = 0. Furthermore, ^(^t) < 1, n{9^) > 1. 
Together with the property that i^ > for centrally condensed lenses, this implies that 
5 = +1 {5 = —1) at the position of the tangential (radial) critical curve, and \5\ ^ 1 
elsewhere. Hence, if k{0) > 1, 5 will decrease from to —1 as 9 increases from to 9j-, 
then increase from —1 to +1 for 9^- < 9 < 9ti becoming zero at the point where k = 1, 
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and decrease towards as ^ ^ oo. If k{0) < 1, 5 remains positive but smaller than +1 
for all 6. Correspondingly, the function 

r 



9 



(l-K) 



(4.6) 



takes values in the range < g{6) < 1 if k{0) < 1, whereas for ^(O) > 1, g decreases 
from to —1 as 6 increases from to 9j-, decreases towards — cx) when k{9) = 1, and then 
decreases towards zero as ^ ^ oo, with ^f = 1 at ^f For a mass profile of the form 



K 






3/2 



(4.7a) 



so that k{0) = x^^, with 



X 



(4.76) 



n ' ^c — ^ , 

where 9^ is the angular radius of the core of the mass distribution, and ^o is a characteristc 
angular scale, we have plotted the functions 5{x) and g{x) in Fig. 7. 
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Fig. 7. For the mass profile (4.7), the distortion 5 and g are plotted for two different values of the core 
radius ^c- For a;c = 0.5, this lens models has two critical curves, the location of which are indicated by 
Xt: and xt (also in units of 6q) 

Note that positive (negative) values of 5 imply that the image of a round source 
is aligned tangentially (radially) with respect to the cluster center. At a point 6 = 
\0\ (cos (f>, sin (f)), we define the tangential alignment Xt of an image with complex ellipticity 

X 



9ii' 

IX I e as 



Xt := -ne (xe-2'^) 



Ixl cos [2{'d 



|x|cos(2V') 



(4.8) 



where, as before, t/j = -d — cp — 7r/2 is the angle between the radial direction to the cluster 
center and the minor axis of the image. 
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4.2 Determination of the lens center 

Given a set of observed galaxy images mapped through a foreground cluster, one can 
determine the center of this cluster from the observed image ellipticities. In this subsec- 
tion, we investigate several methods for this center determination, and also investigate 
the question of how many galaxy images one needs to detect a signal of lensing to occur. 
Consider first the case that the cluster is not critical, i.e., that k{0) < 1, so that no 
critical curves occur. In that case, 5 > for all 9, so that one expects a net tangential 
alignment at all radii. For each 'test center' X, one can define the function 

N 

Ci(X) = J^cos2^i(X) , (4.9a) 

i=i 

where the sum extends over all images, and ipi(X.) is the angle between the direction 
of a galaxy image to the test center and the minor axis of the image. One expects this 
function to have a maximum at the true center, chosen here to be at X = 0. In the 
function Ci, each galaxy image attains the same weight, independent of the absolute 
value of its ellipticity. One might expect that at radii where the distortion is strong, and 
thus the image ellipticities (or the tangential alignments Xt) are large, the local direction 
to the center is better constrained, and thus define 

N N 

C2{X) = Y,Xti = Y.\Xi\cos2xlji{X) . (4.96) 

i=l i=l 

If we knew the mass profile of the cluster, we could also calculate the function 

N 

C3(X) = 5]5(x,)cos2V'i(X) , (4.9c) 

i=i 

which should be less sensitive to statistical fluctuations than (4.9b): an intrinsically 
strongly elongated source would be strongly weighted by C2, though the direction of its 
image will hardly be affected by a weak distortion, whereas the function C3 would give 
little weight to such an image. Of course, the mass proflle is not known a priori, but as 
will be shown in the nect section, it can be reconstructed, and from this reconstructed 
proflle the function 5{x) can be calculated and used in the calculation of C3. 

If the cluster is critical, i.e., k{0) > 1, those galaxy images for which k > 1 will 
cause a negative contribution on average to the sums in (4.9a&b), since they are radially 
elongated. We therefore deflne, in analogy with (4.9a-c), the following three functions 

N 



C4(X) = ^cos4^i(X) , (4.10a) 

i=l 

N 

C5(X) = 5^|x/cos4V'.(X) , (4.106) 

i=l 

N 

Ce{X) = Y,S\^,)cos4ijj,{X) . (4.10c) 
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It is easily seen that both, tangentially and radially aligned images will contribute posi- 
tively to the sums in (4.10a-c), so that one might expect that these are better measures 
to determine the cluster center than the functions (4.9) in the case of critical clusters. 
One way to visualize the difference between the functions Ci, C2 and C4, C5 is by noting 
that cos 2ipi is a function which is positive on the wedge centered on the minor axis of 
the galaxy image, whereas it is negative on the wedge along the major axis; in contrast, 
cos 4:ipi is positive on all four wedges centered on the axes of the images, and negative in 
between (see also Kochanek 1990 for a similar geometric argument, applied to the arclets 
in A370). With this picture in mind, one would expect that the functions (4.10) yield a 
better 'resolution' of the cluster center than those in (4.9). We shall see below that this 
expectation is only partly justified. We have performed numerical simulations for the 
application of the foregoing methods; these simulations will be briefly described next. 

We simulated cluster mass distributions of the form (4.7); note that this profile 
behaves like an isothermal sphere for x ^ Xc, k, ^ l/(2x), and so the characteristic 
angular scale 60 can be parametrized by the velocity dispersion cr„ of an isothermal 
sphere, 

«« = 4.(^)7%) ■ (4.11) 



c J \ D 



For all simulations presented in this paper, we have set the distance ratio D^s/Dg = 
0.6. Galaxies were randomly placed on the lens plane with a density Nq per square 
arcminute. In principle, one would have to distribute the galaxies randomly in the source 
plane; however, the source counts of faint galaxies have a alope of close to —1, so that 
the magnification effects are just compensated by the corresponding enlargement of the 
solid angle. Each galaxy was then assigned an intrinsic ellipticity x^^\ according to the 
distribution (3.21). Using the local values of k and 7, these galaxies were imaged, i.e., 
their ellipticity % was calculated. 

Using such simulations, we have determined estimates for the cluster center by max- 
imizing the function Cj(X), i = 1 through 6. In Fig. 8 we have plotted the probability 
of the deviation of the estimated cluster center from the true center, measured in ar- 
cminutes, for critical and non-critical clusters, for a circular field of galaxy images with 
radius 2'.5 and a galaxy density of A^o = 16; i.e., about 316 galaxies were used for the 
determination of the cluster center. For the function C3 and Cq, the known distribution 
of S was used. It can be seen that the distribution of the deviations of the estimated 
cluster center from the true one is quite different for the six different functions Ci. In 
fact, for the non-critical clusters, shown in the upper panels of Fig. 8, the functions C4, 
C5 and Cq are consistently more inaccurate than the functions Ci through C3, containing 
cos2ipi, whereas for critical clusters, the functions C4 through Cq yield more accurate 
results than those of (4.9). In all cases, the tendency that C3 is more accurate than C2, 
which in turn is more accurate than Ci, is present, and the same is true for Ce, C5 and 
C4. However, the gain in accuracy by changing from C2 to C3 (and from C5 to Cq) is 
not dramatic, and therefore not really worth the effort, given the additional uncertainty 
in practice about the functional behaviour of 5(x). ME91 estimated that the precision 
with which the cluster center can be determined is of the order of the mean separation of 
galaxy images; this estimate was based on the simplifying assumption that one uses only 
those parts of the cluster outside the critical curves. In our case, the mean separation of 
galaxy images is 0'.25. The median value of the deviation of the estimated cluster center 
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Fig. 8. Determination of the cluster center from image distortions. Plotted is the probability P{AO) 
that the error of the estimated cluster center is larger than A6, measured in arcminutes. For these 
simulations, a field of radius 2'.5 was chosen, with mean density Nq = 16 per square arcminute, i.e., the 
expected number of galaxies per field is 314. The intrinsic ellipticities of the sources were drawn from 
the distribution (3.21), with R = 0.3. The different panels show this probability for different cluster 
parameters, described by the velocity dispersion a^ and the core radius a;c- The upper panels correspond 
to clusters which are too weak to produce critical curves, whereas the lower panels are for critical clusters 
with k{0) > 1; note the different scales in the upper and lower panels. The center was estimated for 
1000 realizations of the galaxy distribution, by maximizing the functions Ci(X) of (4.9 & 10), and the 
different curve correspond to different functions Ci(X), as indicated at the top of the figure 



from the true one is about O'.OS as determined by tlie functions in (4.9) for the non- 
critical clusters, quite independent of a„, in rough agreement with the estimate of ME91. 
However, this median deviation becomes considerably smaller for the critical clusters, 
of order 0'.02 for the critical clusters, if determined by the functions of (4.10). This is 
only one tenth of the mean separation of galaxy images. We therefore conclude that the 
cluster center can be very accurately determined (for assumed spherical clusters) if the 
cluster is critical and thus capable of producing a few fairly strongly distorted images. 

Without determining the mass profile of a cluster, it is not known whether it is 
singular or not; hence, one would like to obtain a profile-independent criterion on which 
function Ci yields the most accurate determination of the cluster center. In fact, as we 
shall see there are two different aspects to this question: one is the accuracy of the center 
position, and the other is the reliability of the result, or its stability. What we mean by 
this remark can be best explained by considering the contour levels of the functions Ci (0) 
and 6*4(0), as plotted in Fig. 9a, for four different cluster profiles. The dashed contour 
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Fig. 9a. Contour lines of the function Ci{0) (dashed contour) and of C4{0) (solid contours) for four 
different clusters. The lensing parameters are the same as those of Fig. 8; the upper left panel has 
ay = 600 km/s, a^c = 1-5, the upper right panel has a^, = 1000 km/s, Re = 1.5; the lower left panel 
has CTt, = 600 km/s, a;c = 0.4, the lower right panel has a^, = 800 km/s, a;c = 0.4. In each case, the 
inner 2x2 arcminutes of the cluster are shown, with the true cluster center at = 0. The crosses in 
each case indicate the galaxy images in the central part of the cluster. The levels of the contours are 
at 60%, 62%, . . . , 98% of the maximum value of the respective function. Note how individual galaxies, 
or small groups of galaxies, have an impact on the local shape of the contours, in particular for those 
corresponding to C4 



correspond to Ci, the solid contours to C4. In all cases displayed, the contours of Ci 
are fairly round, well concentrated around the true center of the cluster, without having 
secondary maxima (except perhaps very shallow ones, close to individual galaxy images). 
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Fig. 9b. Same as in Fig. 9a, except that the contour lines of C2{0) (dashed contours) and C5{0) (solid 
contours) are plotted. Also, the cluster parameters in the four panels are the same as for Fig. 9a 



In contrast, the contours of C4 are much more comphcated, in particular for the clusters 
with small (T„. However, for the cluster in the lower right panel of Fig. 9a, the contours of 
C4 are very well concentrated, and the maximum of C4 lies close to the maximum of Ci . 
By visual inspection of many such contour diagrams, we concluded that the estimate of 
the center from C4 is nearly always more accurate if the contours are 'simple', without 
secondary maxima, and located at those positions where the function Ci is not much 
smaller than its maximum value. On the other hand, in particular for weak clusters, the 
maximum of C4 can be fairly far away from the true lens center. From this we conclude 
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than the determination of the cluster center is more rehable from Ci , but more accurate 
from C4 if the topology of the contours of C4 are simple and concentrated around the 
maximum of Ci. Unfortunately, we cannot provide a more quantitative criterion on 
which of the function Ci should be used in a given situation. Since we have seen above 
that C2 and C5 yield a more accurate center position than Ci and C4, respectively, we 
have also plotted the contour levels of these two functions in Fig. 9b. Similar remarks as 
those to Fig. 9a apply here. The function C2 yields a more reliable result than C5, in the 
sense that the topology of the contours of C2 are 'simple'. The maximum of C5 yields a 
more accurate determination of the cluster center in those cases where the topology of 
its contours are simple and centered near the maximum of C2. 

Finally, we consider the requirement on the lens strength to produce a significant 
image distortion for being detected. Suppose that we suspect a mass concentration at a 
point = (e.g., due to the presence of emission there); from observing galaxy images 
within a circle of radius ^out around this point, we can detect the mass concentration by 
comparing the value of the function C2(0) with the dispersion of this functional value 
caused by an undistorted distribution of galaxy images. Without a lens, we have 

(C2)o = 0;a(C2) = y^V^ , (4.12) 

where M2 is the mean square ellipticity of the galaxies, as defined in (3.18), and N = 
TTO^Q^^. is the number of galaxy images. The mean value of C2(0) in the presence of a 
lens is 

(C2(0)) = J2 (Xti) ~ 27rn f °" d^ 9 {xti) = 2nn f °" dd ({5 (6)) 5 (0) , (4.13) 
i=i Jo Jo 

where in the final step we made use of (3.16), and the function ({5) is plotted in Fig. 1. 
Hence, in order to detect a significant distortion, we require the "signal-to-noise" ratio 



(C2(0)) _ 2^/2 ViV f^° 



n 

f °"' dd d C {S (6)) 5 (9) (4.14) 

Jo 



to be larger than unity. As an illustrative example, we use a singular truncated isothermal 
sphere, i.e., we use (4.7a) with ^c = for ^ < ^tmnc, and k = for ^ > ^tmnc- We then 
have 

^^^^ = 1 -L~+2x- forx<x,_ 

^(^) = 4,2 ^°^ ^ > a^trunc , 

X + ^trunc 

with a^trunc = ^trunc/^0- Setting C(5) Ri 1 - see Fig. 1 - and replacing the lower integration 
limit by ^0 (i-e., avoiding galaxy images within the critical curve of the mass distribution), 
we find for a^trunc ^ 1 

(C2(0)) 2x/2^^o 

^(C2) 

where 

J. "trunc 

^out 
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l + 21n 



is a factor of order unity if ^out is not much larger than ^trunc, but decreases for larger 
^out- In other words, increasing the radius within which galaxies are included in the 
determination of C^ much beyond the truncation radius weakens the signal-to-noise ratio. 
In useful units, we obtain 



'1 



(^^m..,^,jD^\(^.y^_^(mi. . (4,6) 



(j{C2) V A / V300km/sy V 40 V 0.3 

Hence, even isothermal distributions with a velocity dispersion similar to that of massive 
cD galaxies can be detected from distortions, provided they extend to sufficiently large 
radii (i.e., several times their critical radius) and the galaxy density is sufficiently large. 
A similar result has been obtain by ME91. 



5 Reconstruction of circular cluster profiles 

Given a radially-symmetric mass distribution, we want to investigate methods to recon- 
struct the density profile from distorted galaxy images. In this section, three different 
methods will be discussed: in Sect. 5.1, the direct inversion will be treated, whereas 
Sects. 5.2 & 3 deal with fitting methods. Briefiy, the latter are the preferred methods if 
the density of galaxy images is relatively small, whereas the direct inversion yields very 
accurate profiles for large galaxy density. Throughout this section, we will work in terms 
of normalized coordinates x = 0/6o, where 9o is a reference angle; if specific examples 
are treated, we shall use the profile (4.7a) and the relation (4.11) between ^o and the 
asymptotic velocity dispersion a„ of the cluster. 

5.1 Direct inversion 

The basis for the inversion of clusters is provided by equations of the form (4.2) or (4.3). 
For example, combining (4.3) and (4.6), we obtain 

K{x) = -9ix)[l-K{x)]+2J ^ [1 - n{x')] 9{x') . 

By rearranging terms and differentiation with respect to x, this equation can be written 
in the form 

<^ r-, ^ M r-, ^ M r-, ^ m f^9{x) , dg{x) 



[1-k{x)] [l-g{x)] = [1-k{x)] [ 



dx \ X dx 

This differential equation can be readily solved to yield the inversion 

|l_^(a,)| = ^ ^--expf- Hdx' r^^^""'? M I , (5.1) 

' ^ ^' |l-^(^)l V -^^ x'[l-g{x')]J ^ ' 

where a is a constant of integration. The latter one is related to the transformation (3.29), 
which leads g{x) invariant, but which changes 1 — k{x) to A[l — k{x)\. Hence, without 
additional assumptions (e.g., that the surface mass density tends to as a; ^ cx)), this 
integration constant cannot be determined from observations of image deformations. 
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Whereas (5.1) provides an exact reconstruction of the cluster profile in terms of 
g{x), it is useless for practical calculations. The absolute values imply that one has the 
freedom to choose between signs. Furthermore, the denominator in (5.1) vanishes at 
the tangential critical curve, although 1 — k remains finite there. This implies that the 
exponential factor has to compensate for the divergence, which indeed is the case, but 
such a situation will be highly unstable numerically. 

A better way for the inversion is provided by combining (4.2) and (4.6), which yields 

k(x) = -— - I —5- / dx' x' k(x') — gix) I . (5-2) 

1 -g[x) \x^ Jq J 

As we shall see below, the denominator (1 — g) provides some difficulty for applying 
(5.2) near critical curves, but this problem can be circumvented. If we assume for a 
moment that the cluster has no critical curve, then the solution of (5.2) is straightforward. 
Suppose the function g{x) were given for x G [O,a;out]; then, in order to solve (5.2), one 
has to choose a value K(xout), and to calculate 

m{xout) = 2 / dx' x' k{x') = ( [l - ^'(a^out)] '^(a^out) + g{xout)) x^^t ■ 
Jo 

Then, by choosing a grid in x, with Xi = iAx, 1 < i < N, N Ax = a^out, and approxi- 
mating the integral in (5.2) by Simpson's rule, one finds the iteration equations 

_ rrii+i - Ax x,_)-i k,+i - xjgj 
'^"~ (1 - gi)x^ + XiAx ' (5.3) 

rrii = rrii+i - {xiKi + Xi+iK.,+i) Ax , 

with gi = g{xi) etc. This set of iteration equations can be solved readily for a given 
function g{x), and it will produce stable results as long as g does not come close to unity. 
Note that the value of N, or, equivalently. Ax is at our disposal, so we can make the 
iterative solution of (5.2) as accurately as desired. 

The largest problem is of course that the function g{x) can at best be determined 
approximately from observations. The observations yield the tangential alignment Xt of 
the individual galaxy images; from that, local estimates of the distortion S{x) can be 
obtained. Here, we proceed as follows: for each grid point Xi, we average the tangential 
alignment Xt of the galaxy images, weighted with a Gaussian; the width of the Gaussian 
can be adapted to the quality of data, and can also be adjusted to depend on (xt)- This 
average then yields a first estimate of 5i = 5{xi). According to (3.16), we then choose 
^i = (Xt) /C((Xt)), with the approximation (3.22) for the function (^. The value oi gi then 
follows from (3.4) as 

9i = -. , (5.4) 

Oi 

where the minus sign applies for regions in the lens plane where the Jacobian matrix 
has positive determinant, and vice versa (cf. also the discussion below Eq.(3.9)). The 
problem with this prescription of g is of course that the value of the determinant is not 
known a priori; we will discuss the problems related to this further below. 

If we assume that the cluster is not critical, the problem of choosing the correct sign 
in (5.4) does not occur, i.e., we have to take the minus-sign. Fig. 10 shows an example 
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Fig. 10. For a cluster with the mass profile (4.7a), corresponding to a velocity dispersion of a^ = 
1200 km/s and a core radius of 0'5, galaxies of density 40/squarearcmin and with internal ellipticity 
distribution (3.21), with R = 0.3 were mapped within the inner 5 arcminutes; in the realization shown 
here, this amounts to TV = 3174 galaxies. The cluster is non-critical, (a) The dotted curve shows 
the theoretical distortion S{x), with x = 6/6q, whereas the solid curve is the distortion 5 obtained by 
smoothing the tangential alignment profile, as described in the text, (b) The dotted curve is the radial 
mass profile k{x) of the cluster. The solid curve shows the reconstructed mass profile, as obtained 
from (5.3) and the function g{x), related to the distrotion profile 5 by (5.4). The value of K;(a;out) 
was chosen to coincide with the true surface mass density at this point. The dashed curves are also 
reconstructed mass profiles, which are related to the solid curve by the invariance transformation (3.29) 
(for A = —0.3,0.5,0.9, 1.1). The outermost point shown corresponds to 4'.5 



for the reconstruction in the non-critical case. The cluster parameters are indicated in 
the caption, and, as always in this paper, we have chosen D^s/^s = 0.6. The upper 
panel compares the theoretical distortion function 5{x) with the one obtained by the 
preceding smoothing procedure. The amplitude of the variations depend of course on 
the smoothing scale. In the lower panel, we have plotted the true mass profile (dotted 
curve), and the profile obtained by applying (5.3) and (5.4) to the distortion 5 as plotted 
in the upper panel (solid curve). The value of /«(rEout) was chosen such that it agrees 
with the actual value at this point. It can be clearly seen that the relative variations of 
the reconstructed profile are much smaller than the corresponding variations in 5, i.e.. 
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the reconstruction smoothes out noise in the distortion. The dashed curves in Fig. 10b 
also show mass profiles which yield the same distortion profile S{x) as the solid curve, 
and are related to the latter by the invariance transformation (3.29), with four difi'erent 
values of A. It can be seen that both 'reasonable' and 'unreasonable' (negative surface 
mass density - for A ^ 1/(1 — Kout), or profiles which increase outwards - for A < 0) 
mass profiles are compatible with the distortion 5{x). Hence, without further 'external' 
constraints, one cannot distinguish between the various profiles shown in Fig. 10b from 
the distortion data. The mass inside a radius x behaves like 

m{x, A) = Xmo{x) + (1 — X)x (5-5) 

under the transformation (3.29). Besides this basic ambiguity, which is inherent in all 
cluster reconstructions, the reconstructed profile follows the true profile remarkably well. 
Due to the small number of galaxy images close to the center of the cluster, the innermost 
part of the mass profile in not very well constrained; we shall see this to be the case also 
for the critical clusters to be discussed next. 

When the cluster is critical, three essential problems occur. The first is the correct 
choice of the sign in (5.4), the second is the singular behaviour of (5.2) near g = 1, and 
the third is related to the fact that near the radial critical curve, there are at most a few 
galaxy images, so that the profile S{x) near the radial critical curve (and, more generally, 
close to the center) is not well determined from observations. We shall deal with these 
problems in turn. For illustration, we consider, as before, a cluster of the form (4.7) with 
velocity dispersion a„ = 1200 km/s, but this time with a core radius of 0'.2, which implies 
a central surface mass density of kq = 2.07. Since the outer part of the cluster is nearly 
the same as that of Fig. 10, and the reconstruction of the outer parts of the cluster is 
unaffected by the the mass profile in the inner part, we choose an outer radius of 2'.5 in 
which we distribute galaxies with a surface mass density of 40/(arcmin)^. In Fig. 11a, 
we have plotted the true distortion 5 as dashed curve, and the reconstructed distortion 
as obtained from smoothing as the solid curve. It can be seen that the smoothed version 
follows the true curve very closely, with the largest discrepancy occurring close to the 
radial critical curve. This has two reasons: first, the number of galaxy images within the 
radial critical curve is small (in the realization shown in Fig. 11, there are only 10 galaxy 
images with x < 0.5), and second, the distortion 5 shows a sharp minimum at x = Xr, 
which will be very much softened by a smoothing procedure. 

Taking the smoothed profile as it is, we can calculate gi from (5.4). In a first trial, 
we use the minus-sign in (5.4) throughout, which is equivalent of pretending to have no 
critical curve. Fig. lib shows the corresponding reconstructed mass profile in this case. 
Outside the tangential critical curve, the fit is as good as the one shown in Fig. 10, but 
it becomes grossly wrong for x < Xt, as expected, when compared to the true mass 
profile, which is shown as dotted curve. Since the inversion relation (5.2) is singular 
near g = 1, and since the reconstructed distortion profile 5{x) is very close to unity near 
the tangential critical curve, a sharp rise occurs in the reconstructed profile at x = xt. 
Interestingly, the profile reaches a value close to, but smaller than, unity at the center, 
and increases outwards near the center. This is due to the fact that 5 < near the 
center. From (4.4) we see that 5 < can occur either ii k > 1 or F < 0. Since F = R — k, 
F < implies R < k, which can only occur if the k increases with radius. 

It should be noted that the density profile shown in Fig. lib provides a perfect fit to 
the smoothed distortion function and, taken at face value, is an acceptable reconstruction. 
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The reason why one would reject this reconstruction is on physical grounds: the projected 
density of a cluster is not expected to rise with radius. Also, the sharp rise in k at x = xt 
would appear artifical. Since the reconstructed distortion 5 comes very close to unity, 
one could at least suspect that the cluster has a critical curve. 

Thus, we next assume that there is a tangential critical curve near the point where 
the reconstructed distortion profile has its maximum. In this case, we shall try to locate 
the critical curve by fitting a distortion 5{x) near the maximum to the original data. 
This is done by noting that at x = rct, the distortion must locally behave as 

Sfit = l-^{x- xtf + o({x- xtf) . (5.6) 

This behaviour can be obtained from (5.2) by expanding k and g around Xt and noting 
that m{xt) = 1; this then yields that dg/dx = — 2/a;t at a: = Xt. Inserting this result into 
(3.4) then yields (5.6). We searched the location of the critical curve in the following 
way: for each trial value of xt we have taken the three nearest galaxy images on both 
sides and fitted the distortion (5.6) to these images. The value of xt chosen corresponds 
to the minimum of the error 

where the sum extends of the six nearest galaxy images, xa are their tangential align- 
ments, and (T^{5) is defined in (3.17) and plotted in Fig. 2; for the fit, we used the 
approximation a^, as given in (3.23). With this procedure, the location of the tangential 
critical curve can be accurately determined. Due to the small number of galaxy images 
near the radial critical curve, a similar procedure near x,- is not practical. We estimated 
Xy. simply as the minimum of the smoothed distortion 5. 

Having determined xt and x,-, we can now determine the sign to be used in (5.4), 
i.e., we take the minus-sign for x > Xt and x < x,., and the plus-sign for x,. < x < Xt. 
With this choice of gi, we have applied the recursion relations (5.3) and obtained the 
radial mass profile shown in Fig. lie. One can see that this profile is even worse than 
that shown in Fig. lib, and the reason for this can be easily understood. At the critical 
curves, the sign in (5.4) changes, which implies that g changes from values l^^l < 1 to 
\g\ > 1. Since ^ is a very sensitive function of 5 for \5\ close to unity, this change is 
related to a large change in g across the critical curves, even if |5| comes very close to 
unity, at least near Xt. To illustrate this quantitatively for the case shown in Fig. 11, 
the maximum of the smoothed distortion is 5 = 0.98885, so that g changes from 0.86 
to 1.16 across the tangential critical curve. Even more drastic is the jump across the 
radial critical curve from —0.53 to —1.90, with the minimum of 5 being —0.825. These 
jumps of g cause the discontinuities of the reconstructed profile in Fig. lie. Combined 
with the singular behaviour of (5.2) near xt, the resulting profile even leads to negative 
values of k near the center of the cluster (not shown in the figure). But again we note 
that the profile in Fig. lie provides a perfect fit to the smoothed distortion 5{x), so that 
this profile must again be rejected on physical grounds. Here, the negative values of k 
are unacceptable, as well as the discontinuities in k. Additionally, plotting g{x), the 
discontinuities are very obvious, and sufficient to reject the final result. In fact, for every 
given distortion function 5(x), one can obtain an infinite number of mass profiles which 
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provide exact solutions of (5.2); two of them are shown in Figs. llb,c. To generate more, 
one can just arbitrarily choose a change of sign in (5.4) at any x\ the resulting profiles 
are extremely badly behaved, in that they have discontinuities and negative surface mass 
density, but they are perfect fits to the original distortion function. 



2.5 




Fig. 11. For the same cluster model as in Fig. 10, except that now the core radius is 0'2, galaxies with the 
density of 40/arcmin and the internal ellipticity distribution (3.21) with R = 0.3 were mapped within 
in the inner 2'5. In the realization shown, this leads to a total oi N = 784 galaxy images. The cluster 
is critical, (a) The true distortion S{x) (dashed curve), the smoothed reconstructed distortion (solid 
curve), and the fitted distortion (dotted curve), which deviates from the smoothed distortion near the 
critical curves, (b) Solid curve shows the reconstructed density profile k{x), obtained from the smoothed 
distortion by assuming a negative sign in (5.4) for all x. (c) Reconstructed density profile from the 
smoothed distortion, with the correct sign in (5.4). (d) Reconstructed density profile from the smoothed 
distortion, but with a fit procedure near the critical curves (for more details, see text). In panels (b)— (d), 
the dotted curve is the true density profile. In all reconstructions, the outermost point was fixed to agree 
with the true density, to fix the transformation freedom (3.29) 

In order to deal with the difficulties encountered near the critical curve, we have used 
a fitting procedure for the distortion near Xt and x^-. We proceed in the following way: 
let xi and X2 be two radial coordinates with a^i < Xt < a;2; in the interval a^i < a; < a;2, 
we expand the mass m{x) as a Taylor series. 



m{x) ^ mtit{x) = xl + ^Joi (x - XtY 



(5.8) 



i=i 



This ansatz automatically satisfies (5.7). The three coefficients Oj are determined by the 
conditions 
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"^fit(a:2) 


= m{x2) , 




dmfit(a;2) 
da: 


= 2X2k{x2) 


J 


dmfit(a;i) 1 - g{xi) 
dx 2a:i 


rrifitixi) 
- xl 


- 9{xi) 



(5.9) 



where the third condition follows from (5.2) at xi, and m(x2) and k(x2) are determined 

from the inward iteration (5.3). Similarly, let x^ < x,. < 0:4 be two coordinates on either 

side of the radial critical curve; then, the surface mass density for a:3 < re < 0:4 is fitted 

by 

3 

k{x)^ Kfit{x) = ^biX^' , (5.10) 

i=0 

and the four coefficients bi are obtained from the conditions 

1 + Kfit(xr) - 2Kfit(a:r) = , 

Kfit{x4) = k{x4) , 

rrifitix^) = m{x4) , (5.11) 

Km{x3)[i^ - gixs)] = ^% ^ -.6^(3:3) , 

where mfit and Rat are obtained from K^t via Eqs.(4.1 & 3). The conditions (5.9) and 
(5.11) lead to algebraic equations for the coefficients, which can be inverted analytically. 
Thus, within the intervals xi < a: < 3:2 and xs < x < X4, we use the approximation 
formulae (5.8 & 10) instead of (5.3). The dotted curve in Fig. 11a shows the corresponding 
equivalent distortion 5 in these intervals, whereas Fig. lid displays the reconstructed 
density profile, using this fitting method. Except very close to the center, where the 
distortion information is sparse, we now obtain a very satisfactory fit for all radii. The 
result now depends on the values oi Xj, 1 < j < 4 for the fitting intervals; they have been 
chosen 'by eye' to get a fairly smooth connection of the fitted distortion to the smoothed 
distortion curve. If the interval is badly chosen, the reconstructed density profile will 
again show nearly discontinuous behaviour. 

Whereas one can probably invent fitting formulae near the critical curves which are 
more sophisticated that (5.8 & 10), it is probably not worth the effort: the more advanced 
the fitting, and the larger the fitting intervals, the more the method deviates from a true 
direct inversion technique. Since the function 5 has sharp features, it is much easier to 
obtain fits for k directly than fitting 5 and determine k, from this fit. Thus, we now turn 
to fitting methods. 

5.2 Fitting by piecewise linear profiles 

A simple way to fit a mass profile to the galaxy ellipticities is the following: select n points 
Xj and assume that the profile can be described, to sufficient accuracy, by a piecewise 
linear function, between Xj and Xj^i. Then, the mass profile is completely determined 
by the values k{xj), 1 < j < n. In order to determine the k{xj), one can minimize the 
error measure 



^-E rt^lf'n (5-12) 
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with respect to the k{xj), where the function <^(5) and (t^{5) have been discussed in 
Sect. 3, and 5{xi) is the value of the distortion calculated from the piecewise linear mass 
profile. One way to achieve this minimization is by the Levenberg-Marquardt method, 
as described in Press et al. (1992); we used the implementation mrqmin as described in 
this book. The only difference to the standard x^ minimization is that the dispersion cr^ 
is described by the model, and not by the data. So the cr^'s have to be calculated from 
the current mass model at each step of the iteration. 

For simplicity, we have chosen an equidistant grid in x, with Xj = {j — l)a:out/'^, 
and have ^(a^out) chosen to coincide with the true value, in order to fix the freedom of 
transformation (3.29). The quality of the fit can depend sensitively on the initial guess 
for the profile; we have taken as a starting model the k(xj) from the profile shown as 
solid curve in Fig. lid, to fit the same galaxy data. Within a^out, there are N = 632 
galaxy images (note that a^out corresponds to 0.9 x 2'.5), so a good fit should have a x^ 
of about N/2 — n (the factor (1/2) is due to the fact that a^ was calculated for both 
components of S, whereas in the axially-symmetric case, only one component is fitted). 
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Fig. 12. For the same cluster and galaxy data as that used for Fig. 11, piecewise linear models for k are 
shown, for several values of the number n of grid points 



We have used this procedure to obtain profile fits for the same data on which Fig. 11 
is based, for the values 4 < n < 20. For each n, we have calculated the resulting value 
of x^, which can change drastically with n, not necessarily monotonically. If we insert 
the distortion of the underlying mass profile in (5.12), we obtain a value of x^ ~ 289. 
For four different values of n, the profiles are plotted in Fig. 12. Both, from the figure 
and the corresponding values of x^, we see that the fit improves as we go from n = 5 to 
n = 10, but does not improve very much for larger n. In fact, choosing larger values of 
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n makes the profile to follow the fluctuations in the values of Xt of the galaxy images, 
as seen by the fluctuating profiles for n = 15, 20. In fact, even if the underlying profile 
were not known, one would still accept the n = 10 fit to be better than the others shown. 
All curves cross the underlying profile at around k = 1, so this point provides a strong 
constraint on the fit. 



5.3 Fitting by other functions 

The procedure described in the foregoing subsection can be generalized by fitting 
parametrized profiles to the ellipticity data. The piecewise linear profile has the ad- 
vantage that no prejudice enters the fit, whereas a parametrized mass profile always 
implicitly makes assumptions about the true profile. On the other hand, the error mea- 
sure (5.12) can control whether the best-fit model provides a sufficiently good description 
for the ellipticity data. 

We choose here a simple example to show how the method works. Consider the 
profile fit 



^i^) = Yl 



/^ (62 



x^ 



(5.13) 



this fit formula contains 3n free parameters. We have fitted the data used for Figs. 11 
and 12 with (5.13), with n = 3, again by minimizing (5.12) with the help of the routine 
mrqmin in Press et al. (1992). The result is shown in Fig. 13. Note that in this case, 
the parameter A in the invariance transformation (3.29) has not been fixed; instead, the 
smooth power-law behaviour of the fit formula (5.13) for x ^ bj effectively fits a (sum 
of) power-laws to the data at large radius. 
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Fig. 13. For the same cluster and galaxy data as that used for Figs. 11 and 12, the parametrized profile 
(5.13) with n = 3 was fitted to the ellipticity data. The resulting fit is shown as dotted curve, whereas the 
underlying mass profile is shown as solid curve. The value of x is 285, whereas if the true distortion 8 is 
used in (5.12), the corresponding value is 289. The resulting fit parameters are ai = (0.441, 0.053, 0.150), 

b, = (0.411,0.297,0.679), ai = (0.361,0.850,2.282) 
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As seen from Fig. 13, the reconstructed profile matches the underlying mass distri- 
bution very well. Although the latter one can be represented exactly with the fit formula 
(5.13) for n = 2 (oi = 02 = 1/2, bi = b2 = x^ = 0.482, ai = 1/2, 0:2 = 3/2), the 
minimization yields quite different values of these parameters (although the final result 
will depend on the initial choice of model parameters). The fit at larger radii can be 
improved by taking data in a larger field. 

5.4 Discussion 

The three different methods discussed in this section have their strengths and weak- 
nesses. Certainly the strength of the direct inversion is that it makes no assumption 
about the underlying mass profile, although, as pointed out several times, there remains 
an arbitrariness with respect to (1) choosing the parameter A in the invariance transfor- 
mation (3.29) and (2) the local choice of the sign in (5.4) due to the degeneracy between 
g and 1/g. An additional advantage of the direct method is that the intrinsic elliptic- 
ity distribution of the sources need not be known, whereas for the fitting method, the 
function cr^(5) must be known, which depends onps(x'^')- However, for critical clusters, 
the direct inversion method has problems near the critical curves, so we were forced to 
introduce a local fitting of S (or k) close to them in order to obtain the satisfactory fit 
shown in Fig. lid. These problems do not occur in the fitting procedures dealt with 
in Sects. 5.2 & 5.3, though they are burdened with different problems. Certainly, the 
method of Sect. 5.3 uses a prejudice about the general shape of the surface mass density, 
which can however be justified a posteriori by considering the value of x^ oi the final 
result. The piecewise linear profiles treated in Sect. 5.2 do not introduce such a prejudice. 
Both fitting methods have the technical problem that the minimization procedure can 
locate local minima, where they get stuck, or where they run into a parameter regime 
where the Hessian matrix of x^ with respect to the fitting parameters becomes singular. 
That means that the initial choice of parameters have to be tried out, and many different 
trials may be necessary before the solution converges to an acceptable fit. 

Of course, combinations of these methods are possible. For example, the direct 
inversion proceeds from large x to smaller ones; hence, it can be used without problems 
from a^out to a value of x close to the first critical curve (and right to the center if the 
cluster is not critical), and the inner part can be fitted by a parametrized mass model. 
Whereas in the one-dimensional case considered here the direct inversion yields results 
comparable with model fitting, it remains to be seen which of these methods is the more 
successful one in a realistic two-dimensional case. We shall deal with that problem in a 
forthcoming paper of this series (Seitz & Schneider, in preparation). 



6 Conclusions 

Progress in cluster inversion is currently driven by improvements in the observational 
techniques; they have already achieved a level where significant information about the 
general mass profiles of clusters can be obtained (Fahlman et al. 1994; Small et al. 
1994a, b; Bonnet et al. 1994). Future observational progress, such as larger CCD arrays, 
larger telescopes with improved optics etc., can be expected and will make the cluster 
inversion from gravitational distortions the most reliable method to obtain information 
about the matter distribution in clusters of galaxies. Furthermore, the refurnished Hubble 
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Space Telescope will provide high-resolution images of distant faint galaxies, so that their 
intrinsic shape distribution can be expected to be determined, or at least to be very much 
constrained. With these observational prospects in mind, we want to initiate a program 
for the theoretical analysis of such improved data. The first step, cluster inversion in the 
linear regime, has been undertaken in the pioneering work by Kaiser & Squires (1993). 
In a forthcoming paper, we will show that even in the linear regime, the original Kaiser 

6 Squires method can be substantially improved; in particular, the noise can be reduced. 

The present paper should be considered as a second step, in which we build the 
ground for nonlinear inversion techniques. We have concentrated on the simplest possible 
case, namely assumed that all sources have the same redshift; this allows us to concentrate 
on the essential properties of the statistical estimate of the local distortion caused by the 
lens. Our main results can be summarized as follows: 

(1) There is a global invariance transformation (3.29) of the surface mass density 
which leaves the observable distortion unchanged; this transformatin corresponds to 
adding a uniform sheet of matter and simultaneously scaling the density profile, and 
is similar to the 'magnification transformation' found by Gorenstein et al. (1988) in a 
different context. 

(2) There is a local invariance transformation (3.3), which can also be written as 

7 -^ 7(1 — k)'^/ I7I , where 7 is the (complex) shear. 

(3) We have obtained the relation between the local distortion 5 and the local average 
of the image ellipticities (3.16). Depending on the intrinsic shape distribution of the 
sources, the correction factor Cd^l) can deviate significantly from unity. We have provided 
simple fit formulae for this correction factor, as well as for the dispersion of 5. 

(4) The local distortion 5 can also be determined from the orientation of the galaxy 
images only, without using the absolute value of their ellipticity. At least for small 
distortions, this method yields results which are nearly comparable to the one in which 
full information about the image shapes is used (see Fig. 5). Refinements of this technique 
should further reduce the corresponding error. 

(5) Knowing the distribution of intrinsic source shapes allows the determination of 
the dispersion achi of the local shear. This fact has been used several times in the fitting 
procedures to obtain the mass profile of spherical clusters. On the other hand, the general 
shape of the function cr^(5) should not be very different from that shown in Fig. 2, even 
for a quite different distribution in intrinsic source shapes. 

(6) We have investigated several methods to determine the center of an axially- 
symmetric matter distribution. For non-critical clusters, the accuracy with which the 
center can be determined is about 1/3 of the mean separation of the galaxy images; 
however, for critical clusters, the accuracy can be improved to a median value of less 
than one tenth of the mean image separation (see Fig. 8) . 

(7) We have obtained a fully nonlinear inversion of clusters with axially-symmetric 
surface mass density in Sect. 5. Whereas the nonlinear aspects have not produced any 
difficulty for clusters without critical curves, they cause substantial problems for the in- 
ner parts of critical clusters. The symmetric case considered here also provides a useful 
demonstration for the problems one encounters because of the local invariance transfor- 
mation mentioned in (2) above. These problems also indicate the difficulties one has to 
anticipate if a truely two-dimensional matter distribution is to be reconstructed from the 
distortion field. An infinite number of matter distributions are compatible with a given 
set of galaxy ellipticities; these distributions differ by different local choices of the local 
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invariance transformation (2) and the overall constant A in the global invariance trans- 
formation (3.29). Concerning the first of these, the favoured mass profile can be singled 
out only by physical or plausibility arguments (such as the requirement of continuity and 
smoothness of the density profile) . 

We expect that the local invariance (2) can be partially broken if a distribution 
of sources with redshift is considered. However, close to the critical curves where this 
invariance causes the largest problems, one probably cannot hope that an additional 
degree of freedom (source redshift) will ease the reconstruction of the mass profile. From 
image distortions alone, there is no way to break the global invariance transformation 
(3.29). However, the recently suggested method based on spectroscopy of the brighter 
galaxy images, to compare the joint distribution in redshift and magnitude of galaxies 
behind a cluster and in the field (Broadhurst, Taylor & Peacock 1994) can break the 
degeneracy related to this invariance; hence, this latter approach can possibly provide at 
least very useful supplementary information for cluster inversion. 
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